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Abstract 

Background: The soft rot oscomycetol fungus Trichodermo reesei is utilized for industrial production of secreted 
enzymes, especially lignocellulose degrading enzymes. T. reesei uses several different enzymes for the degradation 
of plant cell wall-derived material, including 9 characterized cellulases, 15 characterized hemicellulases and at least 
42 genes predicted to encode cellulolytic or hemicellulolytic activities. Production of cellulases and hemicellulases is 
modulated by environmental and physiological conditions. Several regulators affecting the expression of cellulase 
and hemicellulase genes have been identified but more factors still unknown are believed to be present in the 
genome of T. reesei. 

Results: We have used transcriptional profiling data from T. reesei cultures in which cellulase/hemicellulase 
production was induced by the addition of different lignocellulose-derived materials to identify putative novel 
regulators for cellulase and hemicellulase genes. Based on this induction data, supplemented with other published 
genome-wide data on different protein production conditions, 28 candidate regulatory genes were selected for 
further studies and they were overexpressed in T. reesei. Overexpression of seven genes led to at least 1.5-fold 
increased production of cellulase and/or xylanase activity in the modified strains as compared to the parental strain. 
Deletion of gene 77513, here designated as ace3, was found to be detrimental for cellulase production and for the 
expression of several cellulase genes studied. This deletion also significantly reduced xylanase activity and expression of 
xylan-degrading enzyme genes. Furthermore, our data revealed the presence of co-regulated chromosomal regions 
containing carbohydrate-active enzyme genes and candidate regulatory genes. 

Conclusions: Transcriptional profiling results from glycoside hydrolase induction experiments combined with a 
previous study of specific protein production conditions was shown to be an effective method for finding novel 
candidate regulatory genes affecting the production of cellulases and hemicellulases. Recombinant strains with 
improved cellulase and/or xylanase production properties were constructed, and a gene essential for cellulase gene 
expression was found. In addition, more evidence was gained on the chromatin level regional regulation of 
carbohydrate-active enzyme gene expression. 
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Transcriptional profiling 



* Correspondence: mari.hakkinen@vtt.fi 

\^ Technical Research Centre of Finland, PO Box 1000 Tietotie 2, Espoo 
FI-02044, Vn, Finland 

O© 2014 Hakkinen et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative 
BIoIVIGCI CGntrsI commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly credited. 



Hakkinen et al. Biotechnology for Biofuels 2014, 7:14 
http://www.biotechnologyforbiofuels.conn/content/7/1/14 



Page 2 of 21 



Background 

Plant biomass, consisting mostly of cellulose, hemicellu- 
lose and lignin, is the most abundant renewable energy 
source on earth. Degradation of the biomass and continu- 
ation of the carbon cycle is maintained mainly by micro- 
bial action, especially by fungi of different species. The 
biomass-degrading enzymes produced by these organisms 
also have applications in different fields of industry, in- 
cluding biorefinery applications [1]. Trichoderma reesei 
(an anamorph of Hypocrea jecorina) is an extremely effi- 
cient producer of cellulose- and hemicellulose-degrading 
enzymes, and is therefore widely employed by the enzyme 
industry for the production of its own enzymes as well as 
for producing proteins from other sources [2,3]. The gen- 
ome of T, reesei encodes nine characterized cellulase en- 
zymes and 15 characterized hemicellulase enzymes. In 
addition, a large number of genes encoding candidate 
carbohydrate-active enzymes (CAZy) [4,5] have been iden- 
tified from the genome [6,7]. According to an updated an- 
notation, the genome encodes 201 glycoside hydrolase 
genes, 22 carbohydrate esterase genes and 5 polysaccharide 
lyase genes, of which at least 66 are known or predicted 
to encode cellulolytic and hemicellulolytic activities [8]. 

Energy efficient production of cellulases and hemicel- 
lulases is achieved by tight gene regulation governed by 
inducer-dependent expression of the genes and by re- 
pression of the genes in the presence of fast metabolized 
carbon sources (for reviews see [9,10]). In addition to 
the type of carbon source, additional environmental con- 
ditions are known to affect protein production together 
with the physiological state of the cells, such as pH [11], 
light [12], the specific growth rate and cell density of the 
fungus [13,14], and the physiological state of the mito- 
chondria [15]. Furthermore, the expression of many cel- 
lulase and hemicellulase genes is shown to be under a 
feedback regulation mechanism that functions under 
conditions in which the capacity of the cells to fold and 
secrete proteins is limited and transcriptional down- 
regulation is required to reduce the amount of secreted 
protein produced [16]. 

The variety of environmental and physiological factors 
affecting the enzyme production of T, reesei infers that a 
complex signaling cascade and regulatory network is 
needed for the accurate timing of hydrolytic enzyme 
production. Several regulatory factors for cellulase and 
hemicellulase genes have been characterized, the most 
extensively studied of which are the transcription factor 
CREl, which mediates carbon catabolite repression [17], 
and the major regulator needed for expression, XYRl 
[18]. Other characterized factors are the positively acting 
ACE2 [19] and HAP2/3/5 complex [20], and the nega- 
tively acting factor ACEl [21,22]. Recently, novel factors 
possibly affecting the regulation of genes encoding 
hydrolytic enzymes have been found from Trichoderma 



and other fungi. F-box proteins that have been suggested 
to be involved in the regulation of plant cell wall- 
degrading enzymes have been identified from Aspergillus 
and Fusarium [23,24] . Two putative regulators of cellulase 
and hemicellulase genes named CLR-1 and CLR-2 have 
been identified from Neurospora crassa [25] and a tran- 
scription factor BglR has been suggested to regulate p- 
glucosidase genes of T, reesei [26]. Another recent finding 
is that the putative methyltransferase LAEl is essential for 
the formation of T, reesei cellulases and hemicellulases, al- 
though the precise mechanism is still unclear [27]. In the 
light of recent findings from Trichoderma and other fungi, 
it can be assumed that not all regulatory factors have been 
identified yet and that additional regulatory genes can still 
be found in the genome of T reesei. 

In this study, transcriptional profiling data from T, 
reesei cultivated in the presence of several lignocellulose 
substrates as well as other genome-wide data from dif- 
ferent types of protein production conditions were used 
to identify putative regulators for cellulase and hemicel- 
lulase genes. Several candidate regulatory genes were 
identified, and shown to have an effect on cellulase and 
hemicellulase production when overexpressed in T, reesei. 
Furthermore, the genomic context of the CAZy genes and 
co-regulated candidate regulatory genes were analyzed. 
The data revealed co -regulated regions containing candi- 
date regulatory genes and CAZy genes, as well as other 
genes relevant for the utilization of the carbon source, 
such as transporter genes. The relevance of the regions is 
discussed in the paper. 

Results 

Analysis of transcriptome data to identify candidates for 
regulators of cellulase and hemicellulase genes 

Transcriptome analysis has previously been carried out 
to study the expression of CAZy genes in T, reesei cul- 
tures that were induced by the addition of different types 
of lignocellulose material, purified carbohydrate poly- 
mers or disaccharides (Avicel cellulose, pretreated wheat 
straw, pretreated spruce or sophorose) [8]. In the present 
study, data from the previous work were further ana- 
lyzed and explored to identify candidate regulators for 
CAZy genes and, in particular, for cellulase and hemicel- 
lulase genes. The expression data were clustered using 
Mfuzz [28,29] to reveal groups of co-regulated genes. 
The majority of the genes encoding characterized en- 
zymes and accessory factors involved in lignocellulose 
degradation were found in two clusters. Cluster 10 con- 
tained the major cellulase and p-glucosidase genes 
{cbhl, cbh2, egll, egl2, egl3, eglS, bgll and bgl2) together 
with a set of hemicellulase genes {abfl, bgal, cip2, 
cel74a and xyn3). Cluster 35 contained predominantly 
hemicellulase genes {agll, aglS, manl, aesl, axel, bxll, 
girl, xynl and xyn4) (Figure 1; for gene names, see [8]). 
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Figure 1 Expression profiles of the clusters from Mfuzz clustering containing the majority of the cellulase and hemicellulase genes. 

The expression array dataset on T. reesei cultures induced witli Avicel cellulose, pretreated wheat, pretreated spruce or sophorose (described in 
[8]) were clustered using Mfuzz. AV, Avicel cellulose; CO, control cultivation; SO, sophorose; SP, spruce; WH, wheat straw. 

V J 



Only a few characterized hemicellulase genes were found 
outside these clusters (a-galactosidase genes 1 and 2, 
and xylanase 2 gene). A large number of putative regula- 
tory genes clustered together with the known cellulase 
and hemicellulase genes. In particular, many genes en- 
coding putative fungal C6 zinc finger-type transcription 
factors (containing InterPro domains IPR001138 fungal 
transcriptional regulatory protein, N-terminal and/or 
IPR007219 transcription factor, fungi [30]) were enriched 
within the clusters {P = 0.00027). Within the clusters, 
5.9% of the genes encoded the predicted fungal type 
transcription factors, whereas only 2.5% of the total gen- 
ome content belonged to this class. In addition, the clus- 
ters contained genes encoding candidates for other types 
of Zinc finger proteins, kinases and proteins involved in 
chromatin remodeling or organization, as well as pro- 
teins with InterPro domains indicating different regula- 
tory or signal transduction functions (for the classes of 
the genes, see Table 1). A few known regulators were 
among the co-expressed genes, such as xyrl, the major 
regulator for cellulase and hemicellulase expression [18], 
and the homologues for N. crassa clr-2 [25], Aspergillus 
nidulans creC [31] and Fusarium oxysporum frpl [24]. 

To cover putative regulatory genes induced by the 
substrates but showing different temporal patterns and 
extent of induction (and therefore not clustered together 
with the characterized cellulase and hemicellulase genes), 
the differentially expressed genes at each of the time 
points were identified by comparing the expression level 
in the induced cultures to the level in the uninduced con- 
trol cultures (using Limma package (R, Bioconductor) 
[28,32], and the cut-off P <0.01 in the statistical analysis). 
Altogether, 89 genes with putative regulatory functions 
were either co-clustered with the characterized cellulase 
and hemicellulase genes or showed increased signal level 
in most of the inducing conditions studied. 

In order to get further support for the relevance of 
the 89 candidate genes in cellulase and hemicellulase 



production and to narrow down the number of genes to 
be selected for further studies, the expression of the can- 
didate genes was compared in additional datasets on dif- 
ferent protein production conditions. Transcriptome and 
proteome data from chemostat cultures with different 
specific growth rate, cell density and specific protein 
production rate [14] were explored for expression of the 
candidate genes and production of the corresponding 
proteins. Expression of 14 candidate genes showed either 
positive or negative correlation (absolute value <0.5) to 
the specific protein production rate in the chemostat cul- 
tures. Proteome analysis of the same cultures [14] showed 
that the candidate GCN5-related N-acetyltransferase 
(123668) was more abundant in the cultures with higher 
protein (and cellulase) production level, whereas the 
SEC14-domain protein 81972 and the candidate GCN5- 
related N-acetyltransferase (120120) were more abundant 
in the cultures with low protein (cellulase) production. 
The results are in accordance with the positive and nega- 
tive correlation of the expression of genes 123668 and 
81972 with the specific protein production rate, respect- 
ively. Gene 120120 showed a slightly negative correlation 
with the specific protein production rate. 

The CAZy genes are not randomly positioned in the 
genome. It has been reported that 41% of CAZy genes 
are found in 25 discrete regions ranging from 14 kb to 
275 kb in length, and cases of co-expressed adjacent or 
nearly adjacent genes have been shown [7]. The regions 
of high CAZy gene density were found to contain genes 
encoding proteins involved in secondary metabolism. 
Our study also revealed the presence of regulatory genes 
in close vicinity to CAZy genes. In some cases, co- 
expression of these regulatory genes with CAZy genes was 
also detected. This information was used in the selection 
of candidate regulatory genes for further studies. For ex- 
ample, genes 76677 and 121130 are located in a broad, 
partly co-regulated region containing several CAZy genes. 
These genes include a candidate GH27 a-galactosidase 
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Table 1 Classes, functional domains and domain descriptions of the candidate regulators encoded by the genes that 
are co-regulated with cellulase and/or hemicellulase genes 



Class 


InterPro 


Domain description 


Candidate 




domain 




regulatory 










Fungal transcription factors (Zn2-C6 type) 


IPRDDI 1 
1 r nUU 1 1 JO 


ruiiydl LI dl loLI ipLIUI Idl IcyuldLUly piULclll, l\l Lcllllllldl 


31 


IPRnn771 Q 
Ir nUU/ Z 1 y 


1 Idl IbLI ipLIUI 1 IdLLUI, lUIILjl 




1 1 ell loLI ipLIUI 1 IdLLUlo, UdolL IcULIIIt; Zipptrl Lyptr 


1 r nUU^OZ / 


Raci(^-loi ii^ino vipiPior tranci^ripitipin factor 
DdolL IcULIIIc Zipptrl LI dl loLI ipLIUI 1 IdLLUI 




IPRDI 1 fil 
1 r nu 1 1 u 1 U 


Racir"-loi ir"ino vipiPior trancrripitipin factor K7IP-1 
DdolL IcULIIIc Zippcl LI dl loLI ipLIUI 1 IdLLUI, UZ_ir 1 




Transcription factor, Tcf25 type repressor 


IPRnnfiQQ4 


1 1 dl loLI IjJLIUI 1 IdLLUI ZJ 




Zinc finger, C2H2 type 


1 r r\uu / uo / 


7inr finnpr (^71— l7-t\/np 
z_ii IL iiiiLjci, L-Znz LyjJc 


2 


Zinc finger, otiier types 


ir nuuzo^j 


Z_ll IL IIIILjcl, IVIiInL^ Lyptr 




1 r nUUUU JO 


z_ll IL llliycl, AIM 1 Lyptr 


] 


ir nuuzou/ 


Z_ll IL llliycl, L-UnL. Lyptr 


] 


IPRDDRQI ^ 


Z_ll IL llliytrl, L.n 1 Lyptr 


] 


1 r r\uuu-j / 1 


7inr finnpr r~r~r~l— l-t\/r^p 
z_ii IL llliytrl, L.L.L.n Lyptr 


] 


Clironnatin level regulation/remodeling 


IPRDI ^7^6 
Ir nU 1 jZjO 


rhrnmatin ^PT7 
L.I II Ul 1 Id Lll 1 2>r 1 Z 


] 


1 r fxKJKJKJyDD 


L.IIIUIIIU UUIIIdlll 




IPRnnR7S1 
irnUUOZj 1 


L.IIIUIIIU ol IdUUVV 




IPRDDDI R7 
1 r nUUU 1 OZ 


r^r~KI '^-rolatoH KI-ar"ot\/ltrancforaco 
LD^INJ ItrldLtrU l\l dLtrLyi LI dl 1 jI tri dotr 


6 


IPRDI ^1 7R 
1 r r\u 1 J 1 / o 


Hi<;tnnp H^-KSf^ ?irpt\/ltr?in<;fpr?i<;p RTTl DQ 
nioLUiitr no r\ju dLtrLyiLidi loitridotr, n i i i Ur' 


] 


IPRDDI 714 
Ir nuu 1 z 1 'H- 


Q PT r\ n m 3 i n 
jC 1 UUI 1 Idll 1 


2 


IPRDDD^^D 


^MP7-rp|pitpH 
jlNirz ItrldLtrU 




IPRDDI D7S 
1 r nuu 1 uz J 


DIUIIIU dUJdLtrllL 1 lUI 1 lUIUyy UUIIIdlll 


] 


IPRDDI 4R7 
ir nuu 1 / 


R m m n H n r~n a i n 
Dl Ul 1 lUUUI 1 Idl 1 1 




IPRDDD71 D 
irnuuuz i u 


RTR/Pn7-lil^P 

D 1 D/ r WZ_ 1 1 Ktr 




Protein kinases 


IPRDDD71 Q 
irnuuu/ 1 y 


Pmfoin l/inaco r7if7i\\if\r Hnmain 
riULtrlll Kllldbtr, LdLdiyLIL UUIIIdlll 




IPRDI 1 DDQ 
1 r nU 1 1 KJKJy 


Pmtoin l/i na co-l i I/O Hnr^nain 
riULtrlll l\llldotr lll\tr UUIIIdlll 




G protein signaling 


IPRDI 1 D71 
1 r nU 1 1 uz 1 


A rrocti n-li I/O KI-torr~ninal 
AlltroLIII lll\tr. In Ltrllllllldl 




IPRDI 1 D77 
1 r nu 1 1 uzz. 


A rroQti n-li I/O r~-torminal 

/xllCoLIII II l\C, v_ LCIIIIIIIdl 




IPRDDDR^7 
ir ruuuojz 


r>m"l"oi n-r"rM ir>loH rocor^fnr famiK/ 7 cor"ro"l"in-lil/o 
Ld piULtrlll LUUpitrU ItrbtrpLUI, Idlllliy Z, btrLltrLIII IIKtr 




IPRDDD^47 
1 r nUUUjT-Z 


Roni ilatnr nf r>mtoin cinnalinn 
ntryuldLUI Ul LD piULtrlll oiyildllliy 


2 


Other regulators 


IPRDDDDQS 


PAK-hnY/P71 -Rhn-hinrlinn 
r Ai\ uuA/ rzi niiu uiiiuiiiy 




IPRDDD^R? 
ir nUUUOO/ 


r^M 3 l-cr^O("ifi("/r^mtoi n-t\/mci no r^hmcr^hataco rr^t^QpiVMPir^ roninn 
L-^Udl bptrLII IL/ pi ULtrll 1 LyiUollltr pi lUbpi Id Ldotr, LUI lotrl VtrU ItryiUII 




IPRDDD7Q1 
1 r nuuu / " 1 


nPRl /Fl IKR4/\/?i?iH 
Lur r\ 1 / 1 u 1 N jt-/ yddi i 


] 


IPRDDQD^y 
irr\uu!7Uj / 


1— 1 n m o n H n m 3 i n - 1 i 1/ o 
nUI 1 ItrUUUI I Idll 1 IIKtr 


] 


IPRDDI (^1 1 
irnuu 1 o II 


1 oi ir"i no-ri("hi ronioat 
LtrULIIItr IILII ItrptrdL 


] 


IPRDDRD^D 
1 r nUUOUOU 


KImrA-lil/o 
1 \l 1 1 1 1 A II l\tr 


] 


IPR008914 


Phosphatidylethanola mine-binding protein 


1 


IPR012093 


Pirin 


2 


IPR011989 


Armadillo-like helical 


2 


IPR001313 


Pumilio RNA-binding repeat 




IPR001251 


CRAL-TRIO domain 


1 


IPR005511 


Senescence marker protein-30 


1 


IPR001810 


F-box domain, cyclin-like 


1 


IPR003892 


Ubiquitin system component Cue 


1 


IPR001680 


WD40 repeat 


4 
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gene (59391), a candidate GH2 |3-mannosidase gene (59689) 
and characterized p-glucosidase {bgll) and p-xylosidase 
(bxll) genes (data not shown). Gene 102499 has an inter- 
esting location between a very tightly co-regulated region 
of CAZy genes and putative secondary metabolism genes 
(Figure 2, region 1). Gene 120120 is located in a co- 
expressed region including four genes of the hemicellulase 
gene-enriched cluster (cluster 35), and close to a second 
co-expressed region containing the candidate regulatory 
genes 74765, 55422 and the repressor gene crel (Figure 2, 
region 2). 



Interestingly, we found several loci where a |3- 
glucosidase and/or putative sugar transporter gene is lo- 
cated next to a gene with a putative regulatory function 
and co-expressed with it. Genes 77513, 105263 and 
121121 are located next to candidate p-glucosidase genes 
cellb, celSe and celSd, respectively. The regions including 
genes 77513, 121121 and 26163 (the closest homologue 
for N, crassa clr-2) contain a putative sugar transporter 
gene (Figure 2). 

The focus in selection of candidate regulatory genes 
for further studies was on the genes encoding putative 



( ^ 



Region 


Gene ID 


Overexpression 
construct 


1 Strand | 


Scaffold no. { 


Start 


End 


Class 


Description 


Mfuzz cluster 


AV 
Oh 


AV 
6h 


1 

AV WH 
17h Oh 


WH 
6h 


1 

WH SP 
17h Oh 


SP 
6h 


1 

SP SO 
17h Oh 


SO 
6h 


SO 
17h 


SO 
41 h 




73618 






1 


482161 


490278 


Secondary metabolism 


Candidate polyketide synthase 


24 


0,7 


-0,4 


2,6 


2,1 


-4,2 -4,5 


1,7 


-3,5 -4,3 


1,1 


0,8 


2,4 


0,6 




73621 




+ 


1 


491045 


499128 


Secondary metabolism 


Candidate polyketide synthase 


24 


0,4 


0,1 


2,3 


1,1 


-2,9 


-4,1 


0,8 


-2,9 


-3,8 


0,7 


1,0 


2,1 


0,6 




73623 




+ 


1 


499862 


501401 


Secondary metabolism 


Candidate flavoprotein monooxygenase 


24 


0,1 


0,2 


2,1 


1,3 


-3,5 -4,8 


1,3 


-2,7 


-3,6 


0,8 


0,9 


1,8 


0,9 




43701 




- 




501681 


503623 


Transport 


Candidate multidrug-resistance transporter, 
MFS 


32 


-0,1 


-0,3 


0,7 


0,4 


-2,0 


-3,6 


-0,1 


-1,8 


-3,1 


0,1 


0,0 


0,6 


0,2 




102497 




+ 




504139 


506467 


Regulatory functions 


Candidate fungal transcriptional regulatory 
protein 


24 


-0,1 
0,1 

0,1 


-0,4 


1,1 


0,0 


-1,5 


-2,5 
-4,0 

1,2 


0,1 


-1,6 


-2,6 
-3,1 

1,1 


0,3 


0,1 


1,1 


0,3 


1. 


73631 








506653 


508537 


Metabolism 


Candidate FAD linked oxidase 


15 


0,1 
1,5 


1,7 
3,4 


1,2 
0,7 


-4,1 
-0,5 


1,2 
0,5 


-2,2 
-0,4 


0,5 
-0,2 


0,8 
1,8 


2,0 
2,7 


0,8 
1,8 




102499 


pMH12 






510531 


512829 


Regulatory functions 


Candidate fungal transcriptional regulatory 
protein 


25 




102500 








518168 


518762 


Secreted 


Hypothetical protein 


44 


-0,4 


0,6 


0,8 


0,4 


2,0 


2,3 


0,8 


0,0 


1,2 


0,5 


-0,1 


-0,2 


-0,5 




73632 








520228 


521337 


Carbohydrate esterase family 


Sxel, acetyl xylan esterase 


35 


-0,9 


3,1 


2,4 


-0,7 


3,6 


3,2 


-1,4 


0,9 


1,7 


-2,1 


0,9 


1,3 


2,2 




73638 








523039 


524217 


Carbohydrate binding, CBM 


cipl 


35 


-0,4 


1,5 


1,3 


0,2 


1,5 


1,3 


-0,1 


0,9 


1,4 


-0,2 


1,2 


1,4 


1,8 




73643 


pMH25 




1 


524767 


526118 


Glycoside hydrolase family 61 


egl4/cel61a, candidate polysaccharide 
monooxygenase/endo-p-1,4-glucanase 


35 


0,0 


1,5 


1,3 


0,8 


2,1 


2,1 


0,3 


1,3 


1,8 


0,0 


0,9 


1,3 


1,4 




74765 




2 


765243 


767029 


Nucleasome 


Candidate BTB/POZ domain protein, possibly 
involved in histone deacetylation 


23 


0,0 


1,8 


1,0 


0,1 


2,0 


1,9 


0,1 


1,4 


1,6 


0,0 


1,4 


0,8 


0,3 




103653 




+ 


2 


767953 


768491 


Hypothetical protein 


Hypothetical protein 


23 


-0,1 


1,5 


0,1 


-0,2 


1,8 


0,5 


-0,3 


0,3 


0,4 


0,1 


0,6 


0,3 


0,3 




55422 


pMH2^ 


+ 


2 


773609 


774500 


Regulation & other 


Candidate kinase domain protein 


23 


-0,1 


0,9 


0,6 


0,2 


1,0 


1,1 


-0,1 


0,7 


1,1 


0,1 


0,8 


0,4 


0,3 




103655 






2 


775052 


776674 


Replication and repair 


Candidate RecA family protein 


44 


0,2 


0,4 


-0,1 


0,6 


0,8 


0,7 


0,4 


0,4 


0,3 


0,1 


0,7 


0,3 


-0,1 


2. 


120117 




+ 


2 


785322 


788379 


Regulatory functions 


crel, carbon catabolite repressor 


31 


0,2 


0,2 


0,1 


0,1 


0,4 


0,1 


0,4 


0,1 


0,3 


0,0 


0,0 


-0,1 


-0,2 




103660 






2 


794133 


795265 


Hypothetical protein 


Hypothetical protein 


35 


0,0 


0,4 


0,0 


0,6 


1,2 


0,5 


0,3 


0,9 


0,7 


0,9 


0,6 


0,9 


0,4 




55790 






2 


798986 


800882 


RNA 


Candidate N2,N2-dimethylguanosine tRNA 
methyltransferase protein 


35 


0,0 


0,5 


0,3 


-0,1 


0,5 


0,9 


-0,1 


0,2 


0,6 


0,1 


0,6 


0,4 


0,2 




55478 




+ 


2 


802255 


802927 


Hypothetical protein 


Hypothetical protein 


35 
35 


0,0 


1,9 


1,4 


0,1 
0,1 


1,4 1,6 


-0,1 


0,5 


0,8 


0,5 


1,3 


1,4 


1,1 
1,1 




120120 


pMH22 




2 


803502 


804334 


Regulatory functions 


Candidate GCN5-related N-acetyltransferase, 
chromatin remodeling 


-0,3 


0,8 


0,8 


1,1 


0,9 


0,0 


0,4 


0,5 


-0,1 


0,5 


0,9 




105263 


pMH16 


+ 


5 


132264 


134116 


Regulation 


Candidate fungal transcriptional regulatory 
protein 


10 


0,1 


0,0 


0,1 


0,2 


0,2 


0,6 


-0,1 


0,2 


0,4 


0,4 


0,3 


0,9 


0,8 


3. 


76227 




+ 


5 


134792 


137417 


Glycoside hydrolase family 3 


cel3e, candidate (S-glucosidase 


10 


0,4 


0,8 


1,2 


0,7 


0,1 


0,6 


0,3 


0,7 


0,6 


0,4 


1,2 


2,1 


2,3 




58701 






5 


137510 


138803 


Metabolism/protein transport 


Candidate hexose-6-phosphate isomerase 


10 


0,3 


0,6 


0,9 


0,2 


0,4 


0,7 


0,0 


0,5 


0,3 


0,0 


0,9 


1,6 


2,1 




76659 






6 


6460 


7804 


Hypothetical protein 


Hypothetical protein 


35 


-0,2 


1,3 


0,7 


1,1 


1,7 


1,2 


0,5 


1,2 


1,3 


-0,5 


1,0 


0,8 


1,8 


4. 


121121 


pMH10 


+ 


6 


10751 


13028 


Regulation 


Candidate fungal transcriptional regulatory 
protein 


35 


-0,1 


1,0 


0,5 


0,3 


1,3 


1,5 


-0,3 


0,7 


1,1 


0,1 


0,5 


1,1 


1,2 




46816 






6 


13220 


15861 


Glycoside hydrolase family 3 


cel3d, candidate p-glucosidase 


35 


0,4 


3,4 


2,5 


1,1 


3,0 


3,1 


0,3 


2,1 


2,4 


0,3 


2,1 


2,8 


2,7 




46819 




+ 


6 


16768 


18679 


Transport 


Candidate sugar transporter, MFS 


10 


-0,3 


3,7 


2,8 


0,0 


2,1 


1,7 


-0,9 


1,5 


1,1 


-0,2 


2,2 


3,1 


3,5 




106164 




+ 


6 


1352069 


1353346 


Metabolism 


Candidate short-chain 
dehydrogenase/reductase, SDR 


35 


0,0 


1,8 


1,2 


0,1 


1,7 


1,8 


0,5 


2,1 


2,0 


0,0 


1,7 


1,4 


1,4 


5. 


3405 




+ 


6 


1355561 


1357332 


Transport 


Candidate sugar transporter, MFS 


10 


-0,1 
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0,2 


0,2 


0,4 


-0,2 


0,3 
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-0,2 


0,4 
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0,4 




26163 


pMH9 


+ 


6 


1359971 


1361974 


Regulation 


Candidate fungal specific transcription factor 
(clr2 homologue) 


10 


0,3 


1,0 


0,9 


0,4 


0,6 


0,9 


0,3 


0,9 


0,9 


0,1 


1,3 


1,5 


0,9 




77513 


pMH15 


+ 


8 


422130 


424168 


Regulatory functions 


Candidate fungal specific transcription factor 


10 


0,1 


1,0 


0,5 


-0,2 


0,1 


-0,4 


0,2 


0,7 


0,7 


0,2 


1,0 


1,0 


0,9 


6. 


22197 






8 


425315 


427046 


Glycoside hydrolase family 1 


cellb, candidate p-glucosidase 


10 


0,1 


0,5 


0,3 


0,1 


0,1 


0,0 


0,0 


0,4 


0,2 


0,2 


0,6 


0,6 


0,6 




77517 




+ 


8 


428282 


430101 


Transport 


Candidate hexose transporter, MFS 


10 


0,0 


1,0 


0,6 


-0,2 


0,7 


0,7 


-0,1 


0,8 


0,9 


0,1 


1,0 


0,9 


0,6 




107857 






10 


699048 


700756 


Hypothetical protein 


Hypothetical protein 


35 


0,1 


0,7 


0,8 


0,3 


0,7 


1,1 


-0,2 


0,5 


0,8 


-0,4 


0,8 


0,8 


-0,1 


7. 


107858 


pMH21 




10 


701330 


703726 


Regulatory functions 


Candidate fungal transcriptional regulatory 
protein 


35 


0,0 


0,8 


0,8 


0,0 


0,6 


2,2 


-0,3 


0,7 


1,0 


-0,1 


0,8 


1,0 


0,8 



Figure 2 Tightly co-expressed genomic regions with candidate regulatory genes. The expression array dataset described in [8] was searclied for 
genomic regions witli co-expressed genes. Tlie regions containing a selected candidate regulatory gene with adjacent genes belonging to the same 
Mfuzz gene expression clusters as the major cellulase and hemicellulase genes are shown. The genomic location of the genes is indicated as scaffold 
number, start and end position, and strand in the scaffold as in T. reesei database 2.0 [45]. Gene annotation is as in T. reesei database 2.0. The expression 
data of the genes in the induction dataset with cellulose, wheat and spruce material, and sophorose is shown as the expression cluster number (Mfuzz) 
and fold change of the transcript signals in the induced cultures as compared to the uninduced control cultures at the same time point. The intensity 
of the red color and blue color indicates the strength of positive and negative fold changes as compared to the uninduced control cultures, 
respectively. AV, Avicel cellulose; SO, sophorose; SP, spruce; WH, wheat straw. 
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transcription factors. The selected genes fulfilled several 
of the following criteria: induction by three or more of the 
cellulase- or hemicellulase-inducing substrates used in the 
study; co-clustering with the characterized cellulase and 
hemicellulase genes in the Mfuzz clustering of the expres- 
sion data; correlation of the expression signal with specific 
protein production rate in the chemostat study [14]; 
increased signal of the corresponding protein under good 
protein-producing conditions in proteome analysis of the 
chemostat cultures [14]; and co-localization with cellulase 
and hemicellulase genes in the genome and, preferably, 
also co-expression of the co-localized genes. In addition, 
representatives of genes with functional domains indicat- 
ing different regulatory functions and fulfilling the same 
criteria were selected. Altogether 28 genes were selected 
for further studies (Table 2). 

The expression profiles of the selected candidate regu- 
latory genes together with characterized cellulase and 



hemicellulase genes are represented as a heatmap in 
Figure 3. The heatmap shows fold change data of the 
signals in the induced cultures versus the signals in the 
uninduced cultures at the corresponding time points. 
Expression values of an additional dataset on cultures in- 
duced with a broader set of lignocellulose material (dif- 
ferently pretreated bagasse, oat spelt and birch xylans 
[8]) are also included. In the heatmap, the candidate regu- 
latory genes are divided into three major groups. Genes 
122523, 80291, 74765 and 123668 are co-expressed to- 
gether with the gene cluster containing many of the known 
hemicellulase genes (cluster 35). The genes are moderately 
induced in the presence of the majority of the substrates 
used, but especially on wheat and spruce. The second 
group of candidate regulatory genes showed modest induc- 
tion by the majority of the substrates (IDs 73792, 107858, 
70351, 121130, 123019, 62244, 55422, 76677, 121121 and 
56077). The third group clusters together with many of the 



Table 2 Putative regulatory genes chosen for further studies and the functional domains present in the encoded proteins 



Gene ID 


Construct 


InterPro ID 


Description 


108381 


pMH8 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


26163 


pMH9 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


121121 


pMHlO 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


70351 


pMHll 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


102499 


pMH12 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


62244 


pMH13 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


1 1 1 742 


pMH14 


IPR001138, 


Fungal transcriptional regulatory protein, N-terminal, 


77513 


pMH15 


IPR007219 


Transcription factor, fungi 


105263 


pMH16 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


112524 


pMH17 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


123668 


pMH18 


IPR000182 


GCN5-related N-acetyltransferase 


73792 


pMH19 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


80291 


pMH20 


IPR001138, IPRO007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


107858 


pMH21 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


120120 


pMH22 


IPR000182 


GCN5-related N-acetyltransferase 


47317 


pMH24 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


74765 


pMH25 


IPR001487, IPR000210 


Bromodomain; BTB/POZ-like 


76677 


pMH26 


IPR001138, IPR007219 


Fungal transcriptional regulatory protein, N-terminal; Transcription factor, fungi 


55422 


pMH27 


IPR011009 


Protein kinase-like domain 


121130 


pMH28 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


122523 


pMH29 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


123019 


pMH30 


IPR000719 


Protein kinase, catalytic domain 


54703 


pMH32 


IPR007087 


Zinc finger, C2H2-type 


56077 


pMH33 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


60215 


pMH34 


IPR001138 


Fungal transcriptional regulatory protein, N-terminal 


66966 


pMH35 


IPR001680 


WD40 repeat 


64608 


pMH36 


IPR001680 


WD40 repeat 


81972 


pMH37 


IPR001251 


Cellular retinaldehyde-binding/triple function, C-terminal 



Gene IDs are as in T. reesei database 2.0. 
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Figure 3 Heat map visualization of expression data on the Icnown cellulase and hemicellulase genes and the putative regulatory genes 
in cultures induced with different lignocellulose substrates. The color key indicates tine log2 scale fold change of the transcript signals in the 
induced cultures versus the uninduced control cultures at the same time point. The genes are shown as rows and the samples as columns. The 
legend on the right shows the gene ID and the cluster membership of the gene in Mfuzz clustering of the expression datasets. Dataset 1: Induction 
experiment with Avicel cellulose (0.75%), pretreated wheat straw, pretreated spruce or sophorose; Dataset 2: Induction experiment with Avicel 
cellulose (1%), bagasse, or xylans [8]. C: CAZy gene, R: regulatory gene. The legend below indicates the lignocellulose substrate in the culture and time 
point after addition of the substrate. AV\ 0.75% Avicel cellulose; AV^, 1% Avicel cellulose; BE, enzymatically hydrolyzed bagasse material; BO, untreated 
bagasse material; BS, steam-exploded bagasse material; SO, sophorose; SP, spruce; WH, wheat straw; XB, birch xylan; XO, oat spelt xylan. 



genes in the cellulase-enriched cluster (cluster 10). This 
group includes genes induced mainly by sophorose, Avicel 
cellulose, wheat or spruce, but not with bagasse material, 
and genes hardly induced at all. Detailed transcriptional 
data of the genes is presented in Additional file 1. 



Primary screening of the effects of the candidate 
regulatory genes on the cellulase and xylanase production 
of T. reesei 

In order to investigate the effects of the putative regula- 
tory genes chosen from the data, T. reesei QM9414 
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strains overexpressing the genes were constructed. The 
genes were cloned to an expression vector under the A. 
nidulans gpdA promoter and the expression plasmids 
were transformed to QM9414. A p-glucan plate assay 
was used for preliminary evaluation of enzyme produc- 
tion by the transformants and for selection of repre- 
sentative clones from the transformation for further 
analysis. The recombinant strains were cultivated in 
shake flasks on lactose containing rich medium to 
analyze the effect of the genetic modification on growth 
and protein production. Produced cellulase and xylanase 
activities (Figure 4) were measured throughout the culti- 
vation. The growth of the strain transformed with the 
construct pMH12 was clearly defective as compared to 
the parental strain and to other recombinant strains, 
and was therefore omitted from further studies. The en- 
zyme activity produced during the cultivation of the re- 
combinant strains as compared to the activity produced 
in the cultures of the parental strain is summarized in 
Figure 4. Detailed information on production of the en- 
zymatic activities during the time course of cultivation is 
shown in Additional file 2. 

The strains overexpressing genes 77513, 74765, 80291, 
66966, 123668, 64608 and 122523 (constructs pMH15, 
pMH25, pMH20, pMH35, pMH18, pMH36 and pMH29) 
produced cellulase and/or xylanase activity over 1.5 -fold 
as compared to the parental strain in the shake flask cul- 
tures. The integrity of these seven strains and overexpres- 
sion of the genes were confirmed by southern and 
northern blot analysis, respectively (Additional files 3 and 4). 
Most of the modified strains tested had the overexpression 
construct integrated as a single copy. The strain overex- 
pressing the construct pMH35 had one to two copies ac- 
cording to the Southern hybridization. For the construct 
pMH15, both a single-copy and a double-copy transfor- 
mant were analyzed (Figures 5 and 6). Northern analysis 
showed 1.4- to 23.6-fold overexpression of the gene for 
the strains analyzed (Additional file 4), except for gene 
123668 (pMH18), which was expressed at a low level 
both in the overexpression strain and in the parental 
strain and therefore was not quantified. In addition, a 
number of the recombinant strains (transformed with 
constructs pMH8, pMH13, pMH21, pMH22, pMH24, 
pMH26 and pMH37) produced clearly less enzymatic ac- 
tivity than the parental strain. These genes were omitted 
from further studies. 

Overexpression of gene 77513 (construct pMH15) had 
the most consistent and statistically significant (t-test; 
P <0.05) positive effect on both cellulase and xylanase 
production by T, reesei. The strain produced in the initial 
screening 3- to 4-fold cellobiohydrolase 1 (CBHI) activity, 
2- to 2.5-fold endoglucanase 1 (EGI) activity and 2- to 3- 
fold combined activity as measured against the 4- 
methylumbelliferyl-p-D-lactoside (MUL) substrate (Figure 4). 



The strain also produced 2- to 3-fold more xylanase activ- 
ity as measured against the parental strain. 

The strain overexpressing gene 80291 (construct 
pMH20) produced 2.5-times more CBHI activity, 2- 
times more EGI activity and 2.5-times more total activity 
against the MUL substrate. However, the xylanase activ- 
ity was only slightly improved in this recombinant strain 
(less than 1.5 -fold) as compared to the parental strain. 
The change in the production levels by pMH20 overex- 
pression was statistically significant (t-test; P <0.05). 

The overexpression of gene 74765 (construct pMH25) 
produced the largest amount of cellulase activity as mea- 
sured volumetrically against the substrate MUL, as com- 
pared to the other recombinant strains and to the 
parental strain (almost 3.5-times more than the parental 
strain). Production of xylanase activity was also in- 
creased more than 1.5 times in the recombinant strain. 
However, T, reesei EGI (CEL7B) has been shown to have 
activity against xylans as well and thus the increase in 
xylanase activity could be partly due to the increase in 
EGI production [33]. 

Quantitative PGR of cellulase and hemicellulase genes 

Based on the preliminary enzyme activity measurements, 
strains overexpressing genes 77513, 80291 and 74765 
(constructs pMH15, pMH20 and pMH25) were selected 
for further studies. For clarity, the recombinant strains 
will be referred to by the construct names. A quantita- 
tive PGR analysis of axely bxll, xynl, xyn2, xynSy cbhl, 
cbh2, egll, bgll and xyrl was carried out. The results are 
shown as a fold change of the signals as compared to 
the parental strain QM9414 (Figure 7). For all the 
strains, the expression of cbhl, cbh2 and egll was im- 
proved as compared to the parental strain, although for 
pMH20 and pMH25 the effect was more moderate and 
was detected for pMH20 only at the 3-day time point. 
The expression of the major |3-glucosidase gene bgll was 
clearly improved by the pMH15 and pMH25 constructs 
but not by pMH20. Similarly, the expression of the three 
xylanase genes was improved by pMH15 and pMH25. 
Regarding xylanase gene expression, the overexpression 
of gene 77513 (pMH15) seemed to have most effect on 
xynSy whereas the two other candidate regulatory genes 
were more specific to xynl (only xylanase gene with im- 
proved expression by pMH20). Particularly, overexpres- 
sion of gene 74765 (pMH25) had a major effect on the 
transcription of xynl. The expression of bxll was mod- 
erately improved with pMH15 and pMH25. The clearest 
increase in axel expression was seen with pMH25. The 
expression of xyrl, which encodes the major regulator of 
cellulase and hemicellulase genes, was higher in pMH15 
than in the parental strain but was not affected in the 
other two strains. 
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Figure 4 Cellulase and xylanase production by T. reesei QM9414 recombinant strains overexpressing the candidate regulatory genes. The 

volumetric enzyme production (blue bars) and production per biomass dry weight (red bars) are shown as the fold change of the maximum amount 
of activity produced in the cultures of the recombinant strains as compared to the maximum activity produced in the cultures of the parental strain. 
The values are means of three biological replicates. Error bars show the standard error of the mean. Panels A and B show the total xylanase activity 
against birch glucuronoxylan substrate and cellulase activity against 4-methylumbelliferyl-(3-D-lactoside substrate, respectively. Panels C and D show 
the specific enzymatic activity produced by cellobiohydrolase 1 and endoglucanase 1. Detailed time course data on enzyme production in the cultures 
is shown in the Additional file 2. CBHI, cellobiohydrolase 1; EGI, endoglucanase 1; MUL, 4-methylumbelliferyl-(3-D-lactoside. 
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Figure 5 Production of cellulase activity by two different transformants overexpressing gene 77513. Transformants harboring the 
overexpression cassette as a single-copy (pMH15(S)) or as a double-copy (pMH15) were cultivated in shake flasks with lactose as a carbon source. 
Enzyme activity was measured at four different time points (3, 5, 7 and 9 days). The values are means of three biological replicates. Error bars 
show the standard error of the mean. Panels A and B show the volumetric and production per biomass dry weight of total cellulase activity 
against MUL substrate, respectively. Panels C-F show the specific enzymatic activity produced by CBHI and EGI. CBHI, cellobiohydrolase 1; EGI, 
endoglucanase 1; MUL, 4-methylumbelliferyl-(3-D-lactoside. 



Overexpression and deletion of gene 77513, designated 
as ace3 

Based on the quantitative PGR and enzyme production 
results of the recombinant strain overexpressing the 
construct pMHlS, gene 77513 was selected for more de- 
tailed studies. A recombinant strain was constructed 



from which gene 77513 was deleted (designated Del77513). 
We also analyzed enzyme production by strains having 
both one (pMH15(S)) or two (pMH15) copies of the over- 
expression cassette and in the 77513 deletion strain (all 
the constructs were confirmed by Southern and Northern 
analyses, Additional files 3 and 4). Both overexpression 
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Figure 6 Production of xylanase activity by two different transformants overexpressing gene 77513. Transformants liarboring tine overexpression 
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Figure 7 Quantitative PGR analysis of cellulase and hiemicellulase 
gene expression of strains overexpressing the constructs pMH15, 
plVlH20 and plVIH25. Panels A, B and C show the expression levels of 
the analyzed genes for strains overexpressing the constructs pMH15, 
pMH20 and pMH25, respectively. Expression levels are normalized 
against the signal of sari and are shown as a fold change as compared 
to the normalized expression level in the parental strain. RNA extracted 
after 3 days (blue bars) and 5 days (red bars) of cultivation was used as 
a template. The values are means of three biological replicates. Error 
bars show the standard error of the mean. 



strains were cultivated in parallel with the deletion strain 
and the parental strains. Produced cellulase activity 
against the MUL substrate and xylanase activity were 
measured throughout the cultivation. 

Both overexpression strains produced significantly 
(t-test; P <0.05) more total MUL activity, CBHI, EGI 
and xylanase activity as compared to the parental strain 
(Figures 5 and 6). The improvement in cellulase and 



xylanase production was higher in the double-copy strain 
than in the single-copy strain, indicating that the possible 
double-integration of the expression cassette also ampli- 
fied the positive effect of the overexpressed gene to cellu- 
lase and xylanase production. When gene 77513 was 
deleted, the production of total cellulase activity against 
the MUL substrate was abolished completely (Figure 8). 
Interestingly, production of xylanase activity decreased to 
approximately half that of the parental strain (most sig- 
nificant decrease at day 7), indicating that gene 77513 is 
not essential for the production of xylanase activity but 
does modulate it (Figure 8). 

A quantitative PGR analysis of axel, bxll, xynl, xyn2, 
xynSy cbhly chh2, egll, bgll and xyrl was carried out for 
samples collected from the cultivation of strains pMH15, 
pMH15(S) and Del77513. Due to the different parental 
strains of the overexpression strains and the deletion 
strain, the results are shown normalized with the signal 
of sari (Figures 9 and 10). The expression of cbhl^ cbh2, 
eglly bgll, xynl, xyn2, xynS and xyrl was higher in the 
overexpression strains as compared to the parental 
strain. In accordance with the enzymatic activity mea- 
surements, the increase in the gene expression was 
higher in the double-copy strain than in the single-copy 
strain. The expression of bxll was improved only in the 
double-copy strain. 

Expression of cbhl, cbh2, egll, axel and xyn3 was al- 
most undetectable in the deletion strain as compared to 
the parental strain. The expression of bxll, xynl, xyn2, 
bgll and xyrl was also lower as compared to the parental 
strain. In the light of the enzymatic activity and quantita- 
tive PGR results for the two strains overexpressing gene 
77513 and for the strain with the gene deleted, this gene 
was named activator of cellulase expression 3 (aceS), 

Discussion 

The double-lock gene regulation mechanism, in which a 
master transcription factor regulates an additional trans- 
acting regulatory factor gene together with its actual 
target genes, is well-documented in filamentous fungi. In 
particular, carbon catabolite repression has been reported 
to be mediated by such a mechanism. In the model organ- 
ism A. nidulans, the carbon catabolite repressor GREA 
regulates the ethanol utilization genes by repressing both 
the positively acting regulatory gene alcR and its target, 
alcA [34]. GREA also regulates lignocellulolytic genes by 
repressing the major activator {xlnR) as well as many of its 
target genes, for example, xlnD and xlnB [35]. Similarly, 
the major regulator of cellulolytic and xylanolytic genes in 
T, reesei {xyrl, a homologue of xlnR) is repressed by the 
carbon catabolite repressor GREl together with many 
xyrl target genes [18,36]. 

In this study, we utilized the principle of the double- 
lock mechanism to find new regulators of cellulase and 
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Figure 8 Production of cellulase and xyianase activity by the 77513 deletion strain. Del77513 was cultivated in slial<e flasl<s in lactose 
containing medium in parallel with the parental strain QM9414Amus53. Enzyme activity was measured at four different time points (3, 5, 7 and 9 
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hemicellulase genes, presuming that these regulators 
would be regulated in a similar manner as their target 
genes. We analyzed transcriptome data from T. reesei 
cultures induced with different lignocellulose-derived 
substances to search for candidate regulatory genes. This 
led to identification of 89 candidate genes that were 
co-induced with many of the known cellulase or hemicel- 
lulase genes in the presence of different lignocellulose- 
derived materials. We selected 28 genes for overexpression 
screening by taking into account supporting evidence 
from other genome-wide datasets, such as transcriptome 
and proteome analysis of chemostat cultures with different 
protein production rates [14], as well as location of the 
genes in the genome. 

Clustering of the biosynthesis genes for fungal second- 
ary metabolites together with their regulatory genes in 
the genome, as well as the regulatory cascades including 
chromatin-mediated regulation of the genomic regions, 
is relatively well-characterized in fungi (for a review, see 
[37]). Recent studies have indicated that chromatin level 
regulation also takes place in the regulation of CAZy 
genes of T, reesei. The putative methyltransferase LAEl, 
a homologue of LaeA functioning in chromatin level 
regulation of secondary metabolism in Aspergilli, has 
been shown to be involved in controlling cellulase gene 
expression in T. reesei, although the actual mechanism is 
not fully understood [38]. Furthermore, genes with sig- 
nificant up- or down-regulation during conidiation [39] 
as well as genes whose expression levels correlate with 
the specific production rate of extracellular proteins [14] 



have been shown to be non-randomly distributed in the 
T, reesei genome. Genes encoding, for example, second- 
ary metabolism proteins, CAZys, putative transporters 
and putative transcription factors have been identified 
from such genomic clusters. In addition, the protein 
families of these regulators and the protein families of 
CAZys and secondary metabolism-related enzymes have 
recently expanded in the evolution of filamentous fungi, 
{Pezizomycotina) [40]. Thus, positioning of the regula- 
tory genes in the close vicinity of their target genes 
(or other genes involved in the same process) may not 
be limited to the secondary metabolism genes, but could 
involve the genes active in lignocellulose degradation as 
well. 

The transcriptome data on the cultures induced with 
different lignocellulosic material showed genomic re- 
gions that are co-regulated in an inducer-specific man- 
ner. Of the genes that were co-expressed with the major 
cellulase and hemicellulase genes according to the 
Mfuzz clustering, 22.7% were located in enriched gen- 
omic regions (> three genes within a window of nine 
genes, with a maximal distance of five genes). Of these, 
9.1% (32 genes) were located next to each other in 
patches of three or more genes and were tightly co- 
regulated. 

In addition to the known regulatory gene for hemicel- 
lulase and cellulase genes, xyrl, nine candidate regula- 
tory genes were located in these tightly co-regulated 
regions or within close vicinity (Figure 2). Interestingly, 
four of the genes were located next to a putative sugar 
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Figure 9 Quantitative PCR analysis of cellulase and hiemicellulase gene expression by tlie strains overexpressing gene 77513 and by 
tlie strain with gene 77513 deleted. pMH15 is harboring tlie expression cassette as a double-copy and pMH15(S) as a single-copy. Panels A, B, 
C, D, E and F show the expression levels of cbhl, cbh2, egll, bgll, bxll and oxel genes, respectively. Expression levels are shown as normalized 
against the signal of sorl. RNA extracted after 3 days (blue bars) and 5 days (red bars) of cultivation was used as a template. The values are means 
of three biological replicates. Error bars show the standard error of the mean. 



transporter and/or a |3-glucosidase gene. In addition to 
the release of glucose from cellobiose by extracellular 
p-glucosidases and transport of sugars into the cells, the 
sugar transporters and p-glucosidases may have a special 
role in the onset of the CAZy gene induction. Sugar 
units derived from the complex carbon source may be 
transported inside the cells and further modified by 
intracellular p-glucosidases to form an inducing com- 
pound, such as sophorose via a transglycosylation 



reaction. The gene cel3e, located next to gene 105263 
(pMH16), encodes a predicted extracellular p-glucosidase. 
By contrast, cel3d and cellk located next to genes 121121 
(pMHlO) and 77513/ace3 (pMHlS), respectively, are 
predicted to encode intracellular enzymes. Interestingly, 
the sugar transporter genes located next to genes aceS 
and 26163 have recently been suggested to be involved in 
lactose uptake and cellulase production in lactose- 
containing media [41,42]. 
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Figure 10 Quantitative PCR analysis of xylanase and xyri gene expression by the strains overexpressing gene 77513 and by the strain 
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Co-location of a putative regulatory gene with a p- 
glucosidase gene and a transporter gene is not a unique 
feature of the T. reesei genome. For example, the homo- 
logues of 77513/ace3 (pMHlS) in A. fumigatus (AFUA_ 
016410) and in A, clavatus ( ACL A_0 1970) are accompan- 
ied by a candidate |3-glucosidase gene (AFUA_1G 16400/ 
ACLA_01980) and a candidate hexose transporter gene 
(AFUA_1G16390/ACLA_019190) next to it in the gen- 
ome. Similarly, the homologues of gene 121121 (pMHlO) 
in A, fumigatus (AFUA_7G00210) and in A, nidulans 
(ANIA_02615) are located next to a candidate hexose 
transporter gene (AFUA_7G00220/ANIA_02614), a candi- 
date major facilitator superfamily multidrug transporter 
gene (AFUA_7G2613/ANIA_02614), and a |3-glucosidase 
gene (AFUA_7G00240/ANIA_026142) [43]. 

In a recent study, it was suggested that, in N, crassa, 
the cellulase/hemicellulase regulator CLR-1 would pro- 
mote the expression of cellodextrin transporters and |3- 
glucosidase genes as well as a second regulatory gene, 
clr-2, which in turn activates cellulase genes [25]. In N, 



crassa, clr-2 is essential for cellulase production in the 
presence of Avicel cellulose [25]. In T, reesei, the 
homologue of clr-2, gene 26163 (construct pMH9), is 
located next to a co-regulated sugar transporter gene 
that has recently been described as a lactose permease 
essential for the induction of cbhl and cbh2 [42]. Over- 
expression of gene 26163 alone resulted only in a minute 
enhancement in production of cellulase and xylanase 
activity. However, no close homologue for clr-1 can be 
identified from T, reesei, suggesting an important differ- 
ence in the activation mechanisms of c/r-2/26163 and/or 
the accompanying transporter genes in N, crassa and in 
T, reesei, 

Overexpression of genes 105263 (pMH16) and 121121 
(pMHlO) did not have a significant effect on protein 
production under the conditions studied. However, over- 
expression of aceS, which is located next to a co- 
regulated |3-glucosidase gene {eel lb) and a candidate sugar 
transporter gene in its original locus, resulted in a signifi- 
cantly increased production of cellulase and xylanase 
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activity as compared to the parental strain. Deletion of the 
gene was detrimental to the production of cellulase activ- 
ity and decreased the production of xylanase activity. 
Quantitative PGR analysis of transcript levels of cellulase 
and xylanase genes supported the enzymatic activity mea- 
surements. Therefore, aceS can be considered to code for 
a novel master regulator of cellulase expression and a 
modulator of xylan degrading enzyme expression. Thus its 
role appears to be different from that of XYRl/XlnR, 
which has a major role in both xylan and cellulose de- 
gradation [18,44]. Interestingly, the Mfuzz clustering of 
aceS reflects the quantitative PGR results to some extent. 
The gene clustered together with egll, cbhl, cbh2, bgll 
and xyn3, which were most affected by aceS modifications, 
whereas axel^ bxll, xynl and xyn2 are in different 
clusters. 

Transcription of xyrl was increased in the strains 
overexpressing aceS and decreased in the deletion strain, 
indicating that the effects on the target genes observed 
could be at least partly mediated via xyrl. However, the 
deletion of aceS did not totally abolish xyrl transcrip- 
tion. Therefore, the absence of XYRl is not an explan- 
ation for the total lack of cellulase activity and gene 
expression exhibited by the deletion strain. 

Conclusions 

Gombining genome-wide data on cultures with different 
protein production properties is a useful method for 
identifying novel regulatory genes relevant for cellulase 
and xylanase production in T, reesei. Altogether, overex- 
pression of seven of the candidate regulatory genes re- 
sulted in improved (>1.5 fold) production of cellulase 
and/or xylanase activity as compared to the parental 
strain. Further studies are required to confirm the role 
of most of these genes in cellulase and hemicellulase 
gene regulation and to elucidate the actual regulatory 
mechanisms. However, our data show a positive effect of 
cellulase and/or xylanase gene expression for three of 
the candidate regulatory genes. The deletion of one of 
these genes, ace3, totally abolished cellulase expression 
and reduced xylan degrading enzyme expression, thus 
identifying it as a novel master regulator of lignocellu- 
lose degradation. Furthermore, our data reveal genomic 
regions enriched in co-regulated GAZy genes and candi- 
date regulatory genes, therefore supporting the hypoth- 
esis that chromatin-level regional regulation plays a role, 
at least in part, in the expression of GAZy genes in T. 
reesei. 

Methods 

Strains, media and culture conditions 

Escherichia coli DH5a (fhuA2 A(argF-lacZ)U169 phoA 
glnV44 080 A(lacZ)M15 gyrA96 recAl relAl endAl thi- 
1 hsdRlT) was used for propagation of the plasmids. T, 



reesei Rut-G30 (ATGG 56765, VTT-D-86271), QM6a 
(ATGG13631, VTT-D-071262 T) and QM9414 (ATGG 
26921, VTT-D-74075) were obtained from VTT Gulture 
Gollection (Espoo, Finland). Spore suspensions were pre- 
pared by cultivating the fungus on potato-dextrose plates 
(BD, Sparks, Maryland, USA ) for 5 days, after which the 
spores were harvested, suspended in a buffer containing 
0.8% NaGl, 0.025% Tween20 and 20% glycerol, filtered 
through cotton, and stored at -80°G. For DNA isolation, 
the fungus was grown in a medium containing 0.2% pro- 
teose peptone (BD), 2% glucose, 7.6 g/1 (NH4)2S04, 15.0 
g/1 KH2PO4, 2.4 mM MgS04.7H20, 4.1 mM GaGl2.H20, 
3.7 mg/1 G0GI2, 5 mg/1 FeS04.7H20, 1.4 mg/1 ZnS04. 
7H2O and 1.6 mg/1 MnS04.7H20, pH 4.8. 

Transcriptional profiling data 

Transcriptional profiling data used in the study have 
been described elsewhere [8]. In short, pre-cultures of T, 
reesei Rut-G30 were first cultivated on a minimal 
medium containing sorbitol as a carbon source. Gellu- 
lase and hemicellulase gene expression was induced by 
addition of different lignocellulose material, purified 
lignocellulose-derived polymers or specific disaccharides 
(Gultivation set 1: addition of Avicel cellulose, pretreated 
wheat straw, pretreated spruce or sophorose; Gultivation 
set 2: addition of Avicel cellulose, birch xylan, oat spelt 
xylan, or differentially pretreated bagasse). Wheat straw 
and spruce were pretreated using steam explosion. Three 
different pretreatment methods were applied to bagasse, 
including grinding of the untreated bagasse material, 
steam explosion, or steam explosion followed by enzym- 
atic treatment. Enzymatic pretreatment was done with a 
commercial cellulase and hemicellulase mixture followed 
by a protease treatment. Samples for transcriptional pro- 
filing were collected at different time points of induction 
(0, 6 or 17 h). 

Gustom-made microarray slides from RocheNimbleGen 
were used for transcriptional profiling. Sample prepar- 
ation, hybridization onto microarray slides and collection 
of raw data was carried out as instructed by Roche. The 
microarray data were analyzed using the R package Oligo 
for preprocessing of the data and the package Limma for 
identifying differentially expressed genes [28,32]. In the 
analysis of the differentially expressed genes, the signals in 
the samples of the induced cultures were compared to the 
ones in the uninduced control cultures at the correspond- 
ing time point as described in [8]. Four biological repli- 
cates of each condition and time point were analyzed. The 
cut-off used for statistical significance was P <0.01, and an 
additional cut-off for the log2 scale fold change was set as 
0.4. In addition, the expression array datasets were clus- 
tered using the R package Mfuzz [29]. Go-expressed gen- 
omic clusters were determined by enrichment of Mfuzz 
cluster members in the genomic regions. Three or more 
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gene members of the expression cluster within a window 
of nine neighboring genes and with the maximal distance 
of five genes were considered as a genomic region 
enriched with co-regulated genes. In addition, genomic re- 
gions with multiple adjacent genes belonging to the same 
expression cluster were searched for. 

The expression of the selected candidate regulatory 
genes was compared to the transcriptome and proteome 
data described in [14]. 

Construction of 7. reesei strains overexpressing candidate 
regulatory genes 

The regulatory genes were amplified by PGR using Gate- 
way compatible primers (Table 3) and the genomic DNA 
of T. reesei QM6a as a template. For the majority of the 
genes, the open reading frame (ORF) predictions used 
were as in the genome version 2.0 [45] with the following 
exceptions: the primers for genes 26163 and 64608 and 
the N-terminal primer for gene 47317were designed ac- 
cording to the ORF prediction in archived genome version 
1.0 [46], and the ORF prediction for gene 64608 was 
modified by taking into account expressed sequence tag 
sequence data. In order to construct the plasmid vectors 
for overexpression of the genes in T. reesei, the PGR frag- 
ments were inserted in the expression vector pMS204 
using the Gateway recombination system (One-Tube 
protocol) according to the manufacturers instructions 
(Invitrogen, Garlsbad, Galifornia, USA). The expression 
vector contains the hygromycin resistance gene (ZP_ 
12918108) under the A nidulans gpdA promoter [47] and 
trpC terminator [48], as well as an additional copy of the 
gpdA promoter and trpC terminator for expression of the 
gene of interest (the vector map is illustrated in Additional 
file 5). The plasmids were linearized using Hindlll, Pcil or 
Spel enzyme (New England BioLabs, Ipswich, Massachusetts, 
USA) and transformed to T. reesei QM9414 by polyethyl- 
ene glycol-mediated protoplast transformation [49]. The 
transformants were selected for hygromycin resistance on 
plates containing 150 [ig ml'^ of hygromycin B (Galbiochem, 
San Diego, Galifornia, USA). Stable transformants were 
obtained by streaking on plates containing 125 [ig ml'^ of 
hygromycin B for two successive rounds, after which 
single colonies were obtained by plating dilutions of spore 
suspensions. Integration was verified by PGR with one 
primer binding the gpdA promoter and one binding the 
ORF of the overexpressed gene (the primers used are 
listed in Table 4). The cellulase production levels of trans- 
formants from each construct were assayed on p-glucan 
plates (see below). Southern blot analysis was carried out 
for additional confirmation of the transformants showing 
improved protein production as compared to the parental 
strain. Genomic DNA was isolated using an Easy-DNA 
Kit (Invitrogen) according to manufacturers instructions. 
Southern blotting and hybridization on nitrocellulose 



filters (Hybond N, GE Healthcare, Littie Ghalfont, UK) 
were carried out according to standard procedures [50]. 
Probe fragments were PGR-amplified from the genomic 
DNA. The signals were detected using a phosphorimager 
(Typhoon imager, GE Healthcare). 

Plate assay for p-glucan hydrolysis using Congo red 
staining 

For detection of enzymatic activity against the p-glucan 
produced by fungal colonies, spores were mixed with 50°G 
top agar containing 0.1% |3-glucan (Megazyme, Bray, 
Wicklow, Ireland), 2% lactose (Fagron, Rotterdam, the 
Netherlands), 0.05% proteose peptone (BD), 7.6 g/1 (NH4) 
2SO4, 15.0 g/1 KH2PO4, 2.4 mM MgSO4.7H20, 4.1 mM 
GaGl2.H20, 3.7 mg/1 G0GI2, 5 mg/1 FeS04.7H20, 1.4 mg/1 
ZnS04.7H20, 1.6 mg/1 MnS04.7H20, 0.1% Triton TX- 
100 (Fluka, St Louis, Missouri, USA) and 3% agar Noble 
(BD), pH 5.5, and plated on solid medium (composition of 
the medium was the same as that of the top agar except 
that p-glucan was omitted and the concentration of agar 
Noble was 1.8% (w/v)). After 4 days of cultivation at 28°G, 
the plates were rinsed with 0.9% NaGl, submerged in 0.1% 
Gongo red (Merck, Darmstadt, Germany) in 1 M Tris- 
HGl (pH 9.5), and incubated for 30 min with shaking at 
100 rpm. After the incubation, the plates were washed 
with 0.9% (w/v) NaGl, and the diameter of the colonies 
and the halo around them were measured. The size of the 
halo compared to the colony size was calculated and com- 
pared to the corresponding size of the parental strain 
QM9414. 

Construction of a deletion strain 

The deletion cassette for the deletion of gene 77513 was 
constructed by Golden Gate cloning [51]. The construct 
contained the hygromycin resistance cassette (gpdA pro- 
moter, hygromycin resistance gene, trpG terminator) 
flanked by 1.523 kb and 1.024 kb fragments from the 5' 
and 3' sides of the ORF of 77513, respectively. The 5'- 
flanking region fragment was amplified by PGR with oli- 
gos S 'GCGCGGTCTCCGGGTGGCGAGGTGGGAGA 
AGGGG A-3' and 5'-GGGGGGTGTGGGATGGGAAG 
AGGAGGTGGGTGTTG-3'. The 3 '-flanking region was 
amplified by PGR with oligos 5' -GCGCGGTCTCCGA 
GAAAGGGGTGGGGGAAATGGGG-3' and S'-GCGC 
GGTCTCGGCGGTTGCGTGGGCGTT GGTGGAT-3'. 
The fragments of the marker cassette and the flanks 
were first ligated to a pBsV2 vector [52] and subse- 
quently cloned to a modified pBluescript vector (lacking 
the Bsal site). The deletion cassette was digested from 
the vector with Pmel enzyme and transformed to T. ree- 
sei QM9414Amus53 strain (QM9414 strain from which 
gene 58509 had been deleted) with high targeted integra- 
tion frequency. 



Table 3 Gateway compatible primers for the cloning of the putative regulatory genes 



\-on5iruci 


J primer 




J primer 


piVlno 


c:'r;r;r;r; ATA Ann 


rrriTAr A A a a a Anr AnnrTrnrrir ATrATnrirrirTrTTTriTrTrirTTriri 


Q'r;r;r;r; Arr ArTTTrTAr A Ar A A ArrTrrrTrrTATTrTTrrTrrrrrrrrr A 

J OOOOAv^v^Av^ 1 1 1 O 1 AL-AAoAAAvjL. I VJOVJ 1 v^v^ 1 A 1 \\3\ 1 Vjv^ 1 oL.^L.VJv^L.^^A 






imT A CAAAAAACnC ACCCTA C ATC ATnTTrT A C AC ATnTl^n 


Q'f;r;f;f;Arr ArTTTrTAr A Ar AA ArrrrrrTTTr Arr ArrrrrrTAr Arr 

J '0*a'0>oAL.^Av^ 1 1 1 ^ 1 Av^AA>oAAA>Ov_ 1 VJVJ^j | | | ^Av_VjAv_vj*a*^>o>o 1 A^A^v^ 


pivin 1 u 


'^'ccc-.c-.Ar AAcrn 


irnTArA A A AA AnrAnnrTmrATTAnAnrArA ATnnAnAT 

1 1 O 1 Av^AAAAAA>0<^AO>oL. 1 v^Ov^A^^ 1 A^A^v^Av^AA 1 ^^A^Av. 


Q ' (^(^(^r; A rr A rTTTrT A r A A r A A A rrrr r rTr rr A nrrTrT AT A r A rTT A ATr A r 

J '0'0'0>aAL.^Av^ 1 1 1 O 1 AL.AA>oAAAOv_ 1 vJOvJ 1 vJv_ 1 A^ 1 1 ^ 1 vj 1 A 1 AL-A*^ 1 1 AA 1 v^AL. 


nMHI 1 
pivin 1 1 




rrnTAr A A a a a Anr AnnrTrnmr ATr ATnnrrTrr a ATnrr a Ar 

1 1 O 1 Av^AAAAAAOV^AVJOV^ 1 1 VJV^VJv^A 1 v^A 1 1 v^v^AA 1 VJV^v^AA*^ 


Q ' r;f;r;f; Arr A rTTTrT A r A A r A A A r rTr rrTTTr AT A ATr A r A rr A rrTrTTTr 

D \J\J\J\Jr\v^v^r\v^ \ \ \ \J \ Av^AAOAAAVJV^ 1 VJOVJ 1 1 1 v^A 1 AA 1 v^AOAv^v^AVJV^ \ \ \ \ 


pivin 1 z 




rrriTAr A A a a a a nr a mrTrrirrirATrATriririri a nr a nmrr a 

1 1 >0 1 AL.AAAAAAloL.A'O'OL. 1 1 »0>^»0>^A 1 L.A 1 'o*OV]*oA*oL.Av]L.>oL.L.A 


'i'CCCCACCACTTTCTACAACAAACCTCCCTTCTACCCCTAAATCTATCTACTTCA 
J >OVDOVDAL.*^A>^ I I I vj I AL.AAoAAA>oL. 1 'OVJV] 1 1 1 A*oL.L.V] 1 AAA 1 ^^ 1 A 1 >0 1 r\\J 1 1 \Jr\ 


pivin 1 J 


CCCCAC AACnTl 


TCnTA r AAAAAACnC A r-,r-,rTTr-,rr-,r atc atccc acc-tcc a a a a c^cc a cc 

1 1 /Av^/A/A/A/A/A/A\J^/A\J\jy^ \ \ \Jv^\jv^r\ \ L.A 1 >av^L.Av^>aL.^^AAAA>a 1 v^v^Av^v. 


'^'CCCCACCACTTTCTACAACAAACCTCCCTATCACAACCCAAACCCCCCCCC 
J \J\J\J\jr\y^^r\v^ 1 1 1 >o 1 Av^AA^AAA>av^ 1 ^a^O'O 1 A 1 v^A'oAAv^v^v^AAAL.'oL.^^^o^'O'O 


pivin 1 ^ 




imT A r AAAAAACnC A CCCTTCCCC ATC ATCCCCTCCTCTTA CCCC A CTC 
1 1 ^ 1 AL-AAAAAA^v^A^ov^ 1 1 ^v^^v^A 1 y^r\ \ \ v^y^ \ r\^\j\j'<^r\y^ \ 


Q'f;f;f;f;ArrA rTTTrT A r A A r A A A r rTr r rTrTT A r A AT A rT A A A CTCTTCCC 
J \J\J\J\jr\v^v^r\v^ 1 1 1 ^ 1 Av^AA^AAA>ov^ 1 1 >o 1 1 A^AA 1 A^^ 1 AAA^^ 1 1 1 v^o^^ 


pivin 1 J 




imTAr A A A A AACC ACCCTTCCCC ATC ATCCTCCCCTACTCCCCCCTCT 
1 1 ^ 1 AL-AAAAAAov^A^^v^ 1 1 ^v^^v^A 1 v^A 1 1 ^v^O^^ 1 A^^ 1 v^v^V-V^v^O 1 1 


Q ' A r r A rTTTrT A r A A r A A A r rTr r rTTTT A r r r A A r A A r r rT A rTr r A 

J ^>O^^Av^v^Av^ 1 1 1 ^ 1 Av^AAoAAA^v^ 1 1 1 1 1 A^V-V^AAv^AA^^O^ 1 A^ 1 ^^A 


pivin 1 o 


i^'nCnCnrnACAAr-rn 


rrnTAr A A a a a accacccttccccatcatcaccacctcccaccattcca 

\ \ \J \ Av^AAAAAAvjv^Avj»Ov^ 1 1 VJv_VJv_A 1 v_A 1 »OA*^»^A>0*^ 1 \J^'^r\VJ^r\ \ \ v^v^A 


Q'r;(^f;f;ArrArTTTrTAr AAr AAArrTrrrTrrrArrTrAArr ArrrrrrTAT 

D \J\J\J\Jr\v^v^r\v^ \ \ \ \J \ A^^AAOAAAvjv^ 1 vj»OVJ 1 v_ 1 v^Avjvj 1 »oAA»0>oA*0>OVJv^vj>0 1 A 1 


pivin 1 / 


q ' riririri A r A A riTI 
J \j\3\j\3n.y^n.n.\3 1 


rrriTAr A A a a a Arr Arrrirrrrr ATr ATrrrrrr ATrrrrrrrTrrrr 

1 1 >o 1 AL.AAAAAA>oL.A*ov]L. 1 1 OL.'oL.A 1 L.A 1 *ovaL. 1 \J\jr\ 1 1 1 oL. 1 o 


Q ' r;rir;ri A r r A rTTTrT A r A A r A A A r rrr r rT A r A r ATTr ATrr rrr r r rr r A r 

J >OVDOVDAL.L.AL. I I I »J I Av^AAoAAA>aL. 1 'O'O'O 1 AL.AL.A 1 1 L.A 1 1 V3L.V3L.L.L.AV] 


pivin 1 o 




rrrT A c aaaaaacc a cccttcccc atc ATr r rTrTr rirrTr rTr r r a r 

1 1 ^ 1 Av^AAAAAA^v^A^^v^ 1 1 ^v^^v^A 1 y^r\ \ ^v.*^ 1 1 1 1 ^ 1 1 v^v^v^A^ 


Q'f;f;f;f;ArrA rTTTrT A r A A r A A A r rTr r rTrTT A ATTr A r r A rrrr rTr rrr 

J ^>O^^Av^v^Av^ 1 1 1 ^ 1 Av^AAoAAA^v^ 1 1 1 1 AA 1 1 >oA>oL.A^v^^>oL. 1 


pivin 1 ^ 




rrrT A r a a a a a a cc a cccttcccc atc atcc ATrTr r a ATr nrrr a r a 

1 1 o 1 Av^AAAAAA^v^A^ov^ 1 1 ^v^^v^A 1 v^A 1 \j\jr\ \ \ \jv^r\r\ \ ^'^^ \ \ \ \jr\v^r\ 


Q ' A r r A rTTTrT A r A A r A A A r rTr r rTr rT A r A r A r r rTTTr r r A A A A A r 

J O^^^Av^v^Av^ 1 1 1 ^ 1 Av^AAoAAA^v^ 1 1 v^v^ 1 /Ay^r\\jr\^\Jv^ \ \ \ v^v^oAAAAA^ 


pivinzu 




rrrTAr A A a a a Arr Arrrirrrrr ATr ATrrrrrrrr a Arrr Ar Ar a Ar 

1 1 1 AL.AAAAAA'O^^AVJ'O'^ 1 1 VJv_VJv_A 1 ^A 1 VJVJVJ^v_vo>o*^AA»^>^>oA>oA»^AAv^ 


Q'f;f;f;f;Arr ArTTTrTAr A Ar A A ArrrrrrTATTATATA A Arrrrrr ATr A AT 

J ^>aVD>aAL.L.AL. 1 1 1 >0 1 Av^AA'OAAA'O'^ 1 VJVJ^o | A 1 1 A 1 A 1 AAAv_vjVJvoVJv_A 1 v_AA 1 


pivinz 1 


i^'nnnnArAArrn 


rxrTAr A a a a a Arr Armrrrrr ATr ATmrrr Arrr Arrr r ATr 

1 1 vj 1 A»^AAAAAAvj*^A»0'Ov^ 1 1 vJv^vJv_A 1 v_A 1 vjvj 1 1 v_vJA»0'Ov^A*^v^»OV]A 1 


Q'f:,/^^:,^:, Arr ArTTTrTAr A Ar A A ArrrrrrTTrTA Ar A A Ar ATrTTrrr ArrTr A 

D \J\J\J\Jr\v^v^r\v^ \ \ \ \J \ Av^AAVJAAAvjv^ 1 OOvJ 1 1 v_ 1 AA*OAAAv_A 1 v_ 1 1 v_v^vJAv_v_ 1 \Jr\ 


pivinzz 


c:'r;r;r;r; ATA Ann 


rTrT A r A A A A A A CC ACCCTTCCCC ATC ATCTCCCCCC A A ATCTCC 
\ \\3\ AL-AAAAAAoL-AVDOL. 1 1 VDv^»jL.A I L.A 1 1 L.L.L.VDL.L.AAA 1 1 


'i'CCCCACC ACTTTCT AC A AC A A ACCTCCCTCTTACTC CCTCCTC AT ACTTCT 
J O^JOVDAL-L-AL. I I I O I Av^AAoAAAvjv^ 1 VJOVJ 1 L. 1 1 Av. 1 v^VDO 1 VDL. I oA 1 AL. 1 1 L. 1 


pivinz^ 




rTrT A r A A A A A A CC A CCCTTCCCC ATC ATrTr rTr AAA r r rTTr A ccc 

\ \\J\ Av^AAAAAAov^A^^v^ 1 1 ^L.>oL.A 1 v^A 1 ^ 1 1 v^AAA^^^v^ 1 1 v^A^^v^^ 


Q'(^(^(^(^Arr ArTTTrTAr A Ar A A ArrrrrrTr rTArrrr A A ATrrrrrATATTr 

J >0>0>0>oAv^L.AL. 1 1 1 ^ 1 Av_AAoAAAOv_ 1 OOvj 1 1 A^v^v^v^AAA 1 O^v^v^v^A 1 A 1 1 ^ 


pivinz J 




rTrT A c AAAAAACC A CCCTTCCCC ATC ATC A rTTrTr A A rrrrr rTrTr 

1 1 ^ 1 AL.AAAAAA>oL.A>aov^ 1 1 ^v^^v^A 1 v^A 1 \jr\y^ 1 1 1 ^AA^v^v^L.^^ 1^1^ 


Q ' (^(^(^r; A r r A rTTTrT Ar A A r A A A r rrr r rTrrT A rTr r r r rTrTTr r r rTr 

J >0'0*0'OAv^v^Av^ 1 1 1 ^ 1 Av_AAoAAAvJv_ 1 1 1 r\v^ 1 ^^^^^ 1 1 1 v^VJv^v^ 1 


pivinzD 




rTrT A r A A A A A A r r A r r rTTr rrr ATr ATr rrrr A r A rrrrr A rTr 

1 1 o 1 AL.AAAAAAvjL.A>a^v^ 1 1 ^v^^v^A 1 y^r\ \ ^^v^v^^Av^Av^v^v^L.>oAv^ 1 


Q'(^r;(^r;Arr ArTTTrTAr A Ar A A Ar rrrr rTTTTAr A Ar rrrr rrTr rTrTr r 

J '0>a>0>aAv^v^AL. 1 1 1 ^ 1 A»^AA>oAAA>0>^ 1 >0>0>a 1 1 1 1 A>oAA>Ov^v^v^Ov^v^ 1 1 1 


nK/1 1-197 

pivinz/ 


c:'r;r;r;r; ATA Am 


rTrT A r A A A A A A r r A r r rTTr rr r a r a ATr r rTr rrr rrr rrT 

1 1 O 1 AL-AAAAAAoL-AvJOv^ 1 1 VJv^VJv^Av^AA 1 ov^v^ 1 1 


Q ' r;r;r;r; Arr A rTTTrT Ar A A r A A A r rTr rrTTTr ATTr ATr r rrrr A r A A r A A 

J ovdovjAv^v^Av^ I I I o I Av^AAoAAAov^ 1 vjovd I 1 1 L.A 1 1 v^A 1 L.oL.^^L.AoAAv^AA 


pMH28 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGC^GCGCATCATGAACATGACGACAACGCT 


3 ' GGGG ACC ACnrGTAC A AG A A AGCTGGGTCTATCTATA ACTTGGTA^^GC 


pMH29 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGGTAGCACATAGTCTACCCTCT 


3'GGGGACCACmGTACAAGAAAGCTGGGTCTCATATCGGCACCATGTCG 


pMH30 


5'GGGGACAAG^ 


^GTACAAAAAAGCAGGCTOCGCATCATGCTCATCAACAACCTCGATCC 


3'GGGGACCACmGTACAAGAAAGCTGGGTGTCAGACGAAACGCCGCCAG 


pMH32 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGTOCCGTACGGTGCC 


3'GGGGACCACmGTACAAGAAAGCTGGGTCTCACATACCCATAATCATOCTC 


pMH33 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGACATCAATAACGCATCCCTC 


3'GGGGACCACmGTACAAGAAAGCTGGGTOCCTOATCTCCTGGTGGAAT 


pMH34 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGTCGGACAGTACTCACTCG 


3'GGGGACCACmGTACAAGAAAGCTGGGTAmCGACAATGGCAAGATCCT 


pMH35 


5'GGGGACAAG^ 


^GTACAAAAAAGCAGGCTOCGCATCATGGCCAAGAAGGCGCGTC 


3'GGGGACCACmGTACAAGAAAGCTGGGTGCTAGGCGCCGTOACGACTC 


pMH36 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGTCCAGATOTGTCCGCT 


3'GGGGACCACmGTACAAGAAAGCTGGGTGCATCAATAGGCCGTATCAGAG 


pMH37 


5'GGGGACAAG^ 


TOTACAAAAAAGCAGGCTOCGCATCATGACGGCCGAGTACGAAG 


3'GGGGACCACmGTACAAGAAAGCTGGGTCTCGTOATGCAGCAGCAG 
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Table 4 Primers for the PCR screening of the overexpression 
strains 



Primsr 




|jy |JUrA 


A HTA Ar^rf; A A(^(^ A(^ A ATG 


pivinon 1 


Ar Arr,r,rTTrTTATATrTrr, Arr 




A 1 1 ^ 1 v^*JA I *J 1 OVJV^ 1 1 


nMHI DPI 
pivin 1 un 1 


\^ \ O^OAOAO^AVJV. 1 r\\J\Jr\\J\^ 


nMHI 1 R7 
pivin 1 1 nz 


rc-iTCc-, ATTrr,rr,rTTr, a a r a 

1 \^K3r\ \ \ V^\3\^\3V^ \ \ VjAAv^A 


l^ivii 1 1 z.r\z. 


GATGCACGCCGCCATCGAC-J' 


pivin 1 jnz 


Trr,TTrTrrTrr,TAr;ATTrAr, 

1 1 1 1 v^v^ 1 v^VD 1 AOA 1 1 ^r\\J 


pivin 1 'H-n 1 


(^rTr,r,rTrTTrTrrrTr acac 


nMHI SR^ 
[jivin 1 J no 


Tr, A r,T ATA r,rr,r,rTr, ArTTr,Trr, 


nMHI (^R^ 




nK/lH 1 7R1 
pivin 1 / n 1 


CTCn A r-,r-,r-,rTr-,T A r, a rcr a ctc 


nK/lHI RR1 
pivin 1 on 1 


TT A r A r, A r,r,Tr, Ar, A riTTrrrT 


nMHI QRl 




nMH7nR^ 


Trr, A A ATr,r A r,rr, ATA r. 


nMH71 Rl 
pivii iz 1 r\ 1 


TGGTTCTG G ATC A CTC CTC A 


nK/lH77R1 
pivinzzn 1 


TTrr,TrrTrrr;TrTTf; Ar,r a 


nK/lH74R7 
pivinz'H-nz 


CTC AC CTC CTC CT AC AC ACTA 

^ 1 ^Av^v^ 1 v^VJ 1 v^VJ 1 Av^Av^A*^ 1 A 


nMH7SRl 
pivinz J n 1 


ATr,rr,r,TTr, A rrrr, A r Ar, AT 


nMH7fiR7 
pivinzunz 


r,r,TTr, a ctctcc ATr,TTr,r, a 


nMH77R1 
pivii IZ / n 1 


ATfTTri A rGTrrTTGTrG at 


pMH28Rl 


GCGAATCGACCAGATCGTGT 


pMH29Rl 


GTCCTOCACCGCmCACG 


pMH30R2 


GTAGAAGCGCAATGCGGTGG 


pMH32R2 


CAGATGCACGTCTCCAGAT 


pMH33R1 


TCTGGTCTCGATOCTCGTG 


pMH34Rl 


CATCAGCCTCGTCTCCAGCA 


pMH35R3 


CATCATCAATGTCCTCGAAG 


pMH36Rl 


GTCAGGATAGCGCCTGTCTG 


pMH37Rl 


GTCCGGTACAGCGTGTCAAT 



Primer named pgpdA was used in combination with the gene specific primers. 



Cultivation of 7. reesei in shalce flaslcs 

T. reesei QM9414 and representative clones from trans- 
formations of each of the regulatory factor constructs 
were cultivated on medium containing 4% lactose 
(Fagron), 2% spent grain extract, 7.6 g/1 (NH4)2S04, 15.0 
g/1 KH2PO4, 2.4 mM MgS04.7H20, 4.1 mM CaCl2.H20, 
3.7 mg/1 C0CI2, 5 mg/1 FeS04.7H20, 1.4 mg/1 ZnS04. 
7H2O and 1.6 mg/1 MnS04.7H20, pH adjusted to 5.2 with 
KOH. The culture medium was inoculated with 2 x 10^ 
spores per 200 ml of the medium, and grown at 28°C in 
conical flasks with shaking at 250 rpm for 10 days. The 
strains were cultivated in triplicate. Samples were col- 
lected after 3, 5, 7 and 9 or 10 days of cultivation. For 
RNA isolation, mycelium was collected by filtering the 
samples, and the mycelium was washed with equal volume 
of 0.7% NaCl, frozen immediately in liquid nitrogen and 



stored at -80°C. For measurement of the biomass dry 
weight, the filtered and washed mycelium samples were 
dried at 105°C to constant weight (24 h). Filtered culture 
media was used for enzymatic assays and for measuring pH. 

Enzyme assays 

Cellulase activity against the MUL substrate, CBHI and 
EGI activity was determined by detecting the fluorescent 
hydrolysis product methylumbelliferone released from 
the substrate MUL ( Sigma- Aldrich, Steinheim, Germany) 
as described in [53]. The combined activity of EGI and 
CBHI was measured by inhibiting p-glucosidase activity 
with glucose. EGI activity was measured by adding cellobi- 
ose to inhibit CBHI and glucose to inhibit |3-glucosidase. 
CBHI activity was deduced by subtracting EGI activity 
from the combined CBHI and EGI activity. Endo-|3-1.4- 
xylanase activity was assayed using 1.0% birch glucuro- 
noxylan as a substrate [54]. The released reducing sugars 
were detected with 2-hydroxy-3,5-dinitrobenzoic acid. 
Pure xylose (Sigma- Aldrich) was used as a standard. 

Northern analysis 

Total RNA was isolated from the mycelium samples 
using the Trizol™ Reagent (Gibco BRL, Carlsbad, California, 
USA), essentially according to manufacturer s instructions. 
Northern blotting and hybridization on nitrocellulose fil- 
ters (Hybond N, GE Healthcare) were carried out accord- 
ing to standard procedures [50]. Fragments of the genes 
to be analyzed were PCR amplified from the genomic 
DNA and used as probes in the Northern analysis. The 
signals in the northern blots were quantified using a phos- 
phor imager (Typhoon imager, GE Healthcare), and the 
signals were normalized with those of actin. 

Quantitative PCR 

Total RNA was isolated from the mycelial samples of 
three parallel cultivations collected at the cultivation time 



Table 5 Primers for the quantitative PCR analysis 



Gene 


5' primer 


3' primer 


cbhl 


GCGGATCCTCmCTCAG 


ATGTOGCGTAGTAATCATCC 


cbh2 


TCCTGGTOTOAGCCTGAC 


GCAACAmGGAAGGTOAG 


eg 11 


GTCTACTACGAACTCGAC 


GTAGTAGTCGTOCTATACTG 


bgll 


GCCTCCAAGATCAGCTATCC 


ACCTCCTCACCGATGAACTG 


xynl 


AAACTACCAAACTGGCGG 


TOATGGGAGCAGAAGATCC 


xyn2 


CGGCTAOTCTACTCGTACTG 


TOATGACOTGTOTOGTG 


xyn3 


TACAAGGGCAAGATOGTG 


ACTGGOTCCAATACCGT 


axel 


TAAAGCAGCAATOTCATGG 


GCAGTAAGAOTGATOTGG 


bxll 


GTCACTOTCCAAGCTCAG 


ATCGTOCCTOTCTCCCA 


xyrl 


GAGTATCAGCGCAACmAGCA 


CATCGGTATAGTGCAAGAAGCTC 


sari 


TCTCCACCCTAOTCTGAG 


OTGTOCCCAGGATGAC 
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points 3 and 5 days using Trizol™ Reagent (Gibco BRL), 
essentially according to manufacturer s instructions. Single 
stranded cDNA was synthesized using a QuantiTect 
Reverse Transcription Kit (Qiagen, Hilden, Germany), 
with 1.5 [ig of total RNA as a template. The cDNA sam- 
ples were diluted 1:10 to 1:50 and quantitative PGR reac- 
tions of two technical replicates were performed using a 
LightGycler 480 SYBR Green I Master Kit (Roche, 
Mannheim, Germany) according to the instructions of the 
manufacturer. The instrument used for quantitative PGR 
was Light Gycler 480 II and the results were analyzed with 
LightGycler 480 Software release 1.5.0. (version 1.5.0.39) 
using sari signal for normalization. The primers used in 
the quantitative PGR are listed in Table 5. 

Additional files 



Additional file 1: Transcriptional profiling data of the putative 
regulatory genes. Gene identifiers are as in T. reesei database version 
2.0. Functional Interpro domain identifiers are as in InterPro database. 
Fold changes (log2 scale compared to uninduced control culture at a 
corresponding time point), signal intensities (log2 scale) and significance 
test (R package limma, P <0.01, log2 fold change >0.4) are shown for the 
genes. 1 indicates induction and -1 repression. The intensity of the red 
color and blue color indicates the strength of positive and negative fold 
changes, respectively. Color scales of yellow, red and green indicate 
different intensities of signals, red represents the strongest signals and 
green the weakest signals. AVI, 1% Avicel cellulose; AV0.75, 0.75% Avicel 
cellulose; BE, enzymatically hydrolyzed steam-exploded bagasse; BO, 
ground bagasse; BS, steam-exploded bagasse; SO, sophorose; SP, 
steam-exploded spruce; WH, steam-exploded wheat straw; XB, birch 
xylan; XO, oat spelt xylan. 

Additional file 2: Production of total proteins and cellulase and 
xylanase activity by the recombinant strains at different time points 
of the cultivation. Results are shown for each strain volumetrically (nkat/l) 
and per biomass dry weight (nkat/g). The values are means of three 
biological replicates. Error bars show the standard error of the mean. BGL, 
(3-glucosidase activity; CBHI, cellobiohydrolase activity; EGI, endoglucanase 
activity; MUL, total cellulase activity measured against the substrate 
4-methylumbelliferyl-[3-D-lactoside; XYN, xylanase activity. 

Additional file 3: Results of Southern hybridizations. Position of the 
molecular weight size marker is shown as kb on the left. The restriction 
enzymes used for the digestion in the analysis are indicated by the 
letters: A, Ncol + BstXI; B, Bglll; C, Spel + Bell; D, Clal + BamHI; E, Sad; F, 
Nael; G, Clal + Xbal; H, SnaBI + Xbal; I, StuI; J, Sad; K, StuI; L, XmnI; M, BstEII; 
N, Sspl; 0, StuI; P, Sspl; Q, StuI. For Del77513 strain two different probes 
were used: hygromycin selection marker (hph) open reading frame (N 
and 0) and fragment of the gene 77513 open reading frame (P and Q). 

Additional file 4: Results of Northern hybridizations. Northern blot 
analysis of the expression of the candidate regulatory genes in the 
recombinant strains. (A) mRNA signals of genes 123668, 80291, 74765, 
122523, 66966 and 64608 in cultures of the strains harboring the 
corresponding overexpression cassettes pMH18, pMH20, pMH25, pMH29, 
pMH35 and pMH36, respectively, are shown on the top. The mobility of 
the transcript encoded by the overexpression construct is indicated by 
an arrow in the blot. Samples collected after 3 days of cultivation (two 
biological replicates) were analyzed. The northern hybridization signal of 
actin and staining of total RNA with the SYBR Green II in the same gels 
are shown below each of the northern blots, as indicated. (B) mRNA 
signals of gene 77513 in cultures of overexpression strains pMH15 and 
pMH15(S), and in the Del77513 strain. Samples collected after 3 and 5 
days of cultivation (two biological replicates) were analyzed. The northern 
hybridization signal of actin and staining of total RNA with the SYBR 
Green II in the same gel are shown below, as indicated. (C) Signal fold 



change of the northern signals in the recombinant strain versus the 
control strain. Signal intensities were normalized using the actin signal. 

Additional file 5: pl\/IS204 vector with hygromycin resistance gene 
and gateway cloning cassette under gpdA promoter and trpC 
terminator. AmpR, ampicillin resistance gene; attRl/attR2, att sites for 
recombination; ccdB, ccdB gene for negative selection; CmR, 
chloramphenicol resistance gene; hph, hygromycin resistance gene; MCS, 
multiple cloning site; ORI, origin of replication. 
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